Overview

Dataset statistics

Number of variables10
Number of observations214
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory16.8 KiB
Average record size in memory80.6 B

Variable types

NUM10

Reproduction

Analysis started2020-08-30 15:35:09.585256
Analysis finished2020-08-30 15:35:26.058812
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Dataset has 1 (0.5%) duplicate rows Duplicates
Mg has 42 (19.6%) zeros Zeros
K has 30 (14.0%) zeros Zeros
Ba has 176 (82.2%) zeros Zeros
Fe has 144 (67.3%) zeros Zeros

Variables

RI
Real number (ℝ≥0)

Distinct count178
Unique (%)83.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.518365421
Minimum1.51115
Maximum1.53393
Zeros0
Zeros (%)0.0%
Memory size1.8 KiB

Quantile statistics

Minimum1.51115
5-th percentile1.515401
Q11.5165225
median1.51768
Q31.5191575
95-th percentile1.523664
Maximum1.53393
Range0.02278
Interquartile range (IQR)0.002635

Descriptive statistics

Standard deviation0.003036863739
Coefficient of variation (CV)0.002000087527
Kurtosis4.931737386
Mean1.518365421
Median Absolute Deviation (MAD)0.002121332867
Skewness1.625430506
Sum324.9302
Variance9.222541372e-06
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.51115 1.51511 1.515885 1.518515 1.52237 1.53393 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1.52152 3 1.4%
 
1.51645 3 1.4%
 
1.5159 3 1.4%
 
1.52213 2 0.9%
 
1.51763 2 0.9%
 
1.51779 2 0.9%
 
1.51769 2 0.9%
 
1.51793 2 0.9%
 
1.51613 2 0.9%
 
1.51618 2 0.9%
 
Other values (168) 191 89.3%
 
ValueCountFrequency (%) 
1.51115 1 0.5%
 
1.51131 1 0.5%
 
1.51215 1 0.5%
 
1.51299 1 0.5%
 
1.51316 1 0.5%
 
ValueCountFrequency (%) 
1.53393 1 0.5%
 
1.53125 1 0.5%
 
1.52777 1 0.5%
 
1.52739 1 0.5%
 
1.52725 1 0.5%
 

Na
Real number (ℝ≥0)

Distinct count142
Unique (%)66.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.40785047
Minimum10.73
Maximum17.38
Zeros0
Zeros (%)0.0%
Memory size1.8 KiB

Quantile statistics

Minimum10.73
5-th percentile12.415
Q112.9075
median13.3
Q313.825
95-th percentile14.8535
Maximum17.38
Range6.65
Interquartile range (IQR)0.9175

Descriptive statistics

Standard deviation0.8166035557
Coefficient of variation (CV)0.06090488238
Kurtosis3.052232409
Mean13.40785047
Median Absolute Deviation (MAD)0.5988977203
Skewness0.4541814537
Sum2869.28
Variance0.6668413672
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[10.73 12.555 13.725 15.08 17.38 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
13 5 2.3%
 
13.02 5 2.3%
 
13.21 5 2.3%
 
12.85 4 1.9%
 
13.64 4 1.9%
 
13.24 4 1.9%
 
12.86 4 1.9%
 
13.33 4 1.9%
 
12.93 3 1.4%
 
13.2 3 1.4%
 
Other values (132) 173 80.8%
 
ValueCountFrequency (%) 
10.73 1 0.5%
 
11.02 1 0.5%
 
11.03 1 0.5%
 
11.23 1 0.5%
 
11.45 1 0.5%
 
ValueCountFrequency (%) 
17.38 1 0.5%
 
15.79 1 0.5%
 
15.15 1 0.5%
 
15.01 1 0.5%
 
14.99 1 0.5%
 

Mg
Real number (ℝ≥0)

ZEROS
Distinct count94
Unique (%)43.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.68453271
Minimum0
Maximum4.49
Zeros42
Zeros (%)19.6%
Memory size1.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12.115
median3.48
Q33.6
95-th percentile3.85
Maximum4.49
Range4.49
Interquartile range (IQR)1.485

Descriptive statistics

Standard deviation1.442407845
Coefficient of variation (CV)0.5373031364
Kurtosis-0.4103189629
Mean2.68453271
Median Absolute Deviation (MAD)1.209406498
Skewness-1.152559318
Sum574.49
Variance2.080540391
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.165 1.66 3.335 3.465 3.625 3.915 4.49 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 42 19.6%
 
3.54 8 3.7%
 
3.48 8 3.7%
 
3.58 8 3.7%
 
3.52 7 3.3%
 
3.62 5 2.3%
 
3.5 4 1.9%
 
3.66 4 1.9%
 
3.61 4 1.9%
 
3.56 4 1.9%
 
Other values (84) 120 56.1%
 
ValueCountFrequency (%) 
0 42 19.6%
 
0.33 1 0.5%
 
0.78 1 0.5%
 
1.01 1 0.5%
 
1.35 1 0.5%
 
ValueCountFrequency (%) 
4.49 1 0.5%
 
3.98 1 0.5%
 
3.97 1 0.5%
 
3.93 1 0.5%
 
3.9 3 1.4%
 

Al
Real number (ℝ≥0)

Distinct count118
Unique (%)55.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.444906542
Minimum0.29
Maximum3.5
Zeros0
Zeros (%)0.0%
Memory size1.8 KiB

Quantile statistics

Minimum0.29
5-th percentile0.696
Q11.19
median1.36
Q31.63
95-th percentile2.394
Maximum3.5
Range3.21
Interquartile range (IQR)0.44

Descriptive statistics

Standard deviation0.4992696456
Coefficient of variation (CV)0.3455376739
Kurtosis2.060568969
Mean1.444906542
Median Absolute Deviation (MAD)0.359052319
Skewness0.907289809
Sum309.21
Variance0.249270179
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.29 1.105 1.635 2.11 3.5 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1.54 8 3.7%
 
1.19 6 2.8%
 
1.29 5 2.3%
 
1.43 5 2.3%
 
1.23 5 2.3%
 
1.56 5 2.3%
 
1.36 4 1.9%
 
1.35 4 1.9%
 
1.28 4 1.9%
 
1.62 3 1.4%
 
Other values (108) 165 77.1%
 
ValueCountFrequency (%) 
0.29 1 0.5%
 
0.34 1 0.5%
 
0.47 2 0.9%
 
0.51 1 0.5%
 
0.56 2 0.9%
 
ValueCountFrequency (%) 
3.5 1 0.5%
 
3.04 1 0.5%
 
3.02 1 0.5%
 
2.88 1 0.5%
 
2.79 1 0.5%
 

Si
Real number (ℝ≥0)

Distinct count133
Unique (%)62.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.65093458
Minimum69.81
Maximum75.41
Zeros0
Zeros (%)0.0%
Memory size1.8 KiB

Quantile statistics

Minimum69.81
5-th percentile71.315
Q172.28
median72.79
Q373.0875
95-th percentile73.5175
Maximum75.41
Range5.6
Interquartile range (IQR)0.8075

Descriptive statistics

Standard deviation0.7745457948
Coefficient of variation (CV)0.0106611952
Kurtosis2.967902956
Mean72.65093458
Median Absolute Deviation (MAD)0.5556956939
Skewness-0.7304472251
Sum15547.3
Variance0.5999211882
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[69.81 71.735 72.625 73.295 73.845 75.41 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
72.86 4 1.9%
 
73.28 4 1.9%
 
73.1 4 1.9%
 
72.99 4 1.9%
 
73.11 4 1.9%
 
72.97 3 1.4%
 
72.95 3 1.4%
 
73.01 3 1.4%
 
72.64 3 1.4%
 
72.96 3 1.4%
 
Other values (123) 179 83.6%
 
ValueCountFrequency (%) 
69.81 1 0.5%
 
69.89 1 0.5%
 
70.16 1 0.5%
 
70.26 1 0.5%
 
70.43 1 0.5%
 
ValueCountFrequency (%) 
75.41 1 0.5%
 
75.18 1 0.5%
 
74.55 1 0.5%
 
74.45 1 0.5%
 
73.88 1 0.5%
 

K
Real number (ℝ≥0)

ZEROS
Distinct count65
Unique (%)30.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4970560748
Minimum0
Maximum6.21
Zeros30
Zeros (%)14.0%
Memory size1.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10.1225
median0.555
Q30.61
95-th percentile0.76
Maximum6.21
Range6.21
Interquartile range (IQR)0.4875

Descriptive statistics

Standard deviation0.6521918456
Coefficient of variation (CV)1.312109194
Kurtosis54.68969853
Mean0.4970560748
Median Absolute Deviation (MAD)0.2943628264
Skewness6.55164831
Sum106.37
Variance0.4253542034
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.01 0.165 0.535 0.615 0.695 0.785 6.21 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 30 14.0%
 
0.57 12 5.6%
 
0.6 11 5.1%
 
0.56 11 5.1%
 
0.58 10 4.7%
 
0.64 8 3.7%
 
0.61 8 3.7%
 
0.59 7 3.3%
 
0.54 6 2.8%
 
0.62 6 2.8%
 
Other values (55) 105 49.1%
 
ValueCountFrequency (%) 
0 30 14.0%
 
0.02 1 0.5%
 
0.03 1 0.5%
 
0.04 2 0.9%
 
0.05 1 0.5%
 
ValueCountFrequency (%) 
6.21 2 0.9%
 
2.7 1 0.5%
 
1.76 1 0.5%
 
1.68 1 0.5%
 
1.46 1 0.5%
 

Ca
Real number (ℝ≥0)

Distinct count143
Unique (%)66.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.956962617
Minimum5.43
Maximum16.19
Zeros0
Zeros (%)0.0%
Memory size1.8 KiB

Quantile statistics

Minimum5.43
5-th percentile7.8125
Q18.24
median8.6
Q39.1725
95-th percentile11.5615
Maximum16.19
Range10.76
Interquartile range (IQR)0.9325

Descriptive statistics

Standard deviation1.423153487
Coefficient of variation (CV)0.1588879566
Kurtosis6.681977951
Mean8.956962617
Median Absolute Deviation (MAD)0.9181269106
Skewness2.047053913
Sum1916.79
Variance2.025365848
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 5.43 7.805 9.075 9.9 11.63 16.19 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
8.03 5 2.3%
 
8.43 5 2.3%
 
9.57 4 1.9%
 
8.44 4 1.9%
 
8.79 4 1.9%
 
8.38 3 1.4%
 
8.83 3 1.4%
 
8.67 3 1.4%
 
8.39 3 1.4%
 
8.53 3 1.4%
 
Other values (133) 177 82.7%
 
ValueCountFrequency (%) 
5.43 1 0.5%
 
5.79 1 0.5%
 
5.87 1 0.5%
 
6.47 1 0.5%
 
6.65 1 0.5%
 
ValueCountFrequency (%) 
16.19 1 0.5%
 
14.96 1 0.5%
 
14.68 1 0.5%
 
14.4 1 0.5%
 
13.44 1 0.5%
 

Ba
Real number (ℝ≥0)

ZEROS
Distinct count34
Unique (%)15.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.175046729
Minimum0
Maximum3.15
Zeros176
Zeros (%)82.2%
Memory size1.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1.57
Maximum3.15
Range3.15
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4972192606
Coefficient of variation (CV)2.840494441
Kurtosis12.54108358
Mean0.175046729
Median Absolute Deviation (MAD)0.2923696393
Skewness3.416424569
Sum37.46
Variance0.2472269931
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.03 0.785 1.56 1.695 3.15 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 176 82.2%
 
1.57 2 0.9%
 
0.64 2 0.9%
 
0.09 2 0.9%
 
1.59 2 0.9%
 
0.11 2 0.9%
 
0.15 1 0.5%
 
1.55 1 0.5%
 
0.61 1 0.5%
 
0.63 1 0.5%
 
Other values (24) 24 11.2%
 
ValueCountFrequency (%) 
0 176 82.2%
 
0.06 1 0.5%
 
0.09 2 0.9%
 
0.11 2 0.9%
 
0.14 1 0.5%
 
ValueCountFrequency (%) 
3.15 1 0.5%
 
2.88 1 0.5%
 
2.2 1 0.5%
 
1.71 1 0.5%
 
1.68 1 0.5%
 

Fe
Real number (ℝ≥0)

ZEROS
Distinct count32
Unique (%)15.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.05700934579
Minimum0
Maximum0.51
Zeros144
Zeros (%)67.3%
Memory size1.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.1
95-th percentile0.267
Maximum0.51
Range0.51
Interquartile range (IQR)0.1

Descriptive statistics

Standard deviation0.09743870064
Coefficient of variation (CV)1.709170651
Kurtosis2.662015617
Mean0.05700934579
Median Absolute Deviation (MAD)0.07748012927
Skewness1.75432747
Sum12.2
Variance0.009494300382
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.005 0.065 0.245 0.36 0.51 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 144 67.3%
 
0.17 7 3.3%
 
0.24 7 3.3%
 
0.09 6 2.8%
 
0.1 5 2.3%
 
0.11 4 1.9%
 
0.07 3 1.4%
 
0.14 3 1.4%
 
0.28 3 1.4%
 
0.16 3 1.4%
 
Other values (22) 29 13.6%
 
ValueCountFrequency (%) 
0 144 67.3%
 
0.01 1 0.5%
 
0.03 1 0.5%
 
0.05 1 0.5%
 
0.06 1 0.5%
 
ValueCountFrequency (%) 
0.51 1 0.5%
 
0.37 1 0.5%
 
0.35 1 0.5%
 
0.34 1 0.5%
 
0.32 1 0.5%
 

Type
Real number (ℝ≥0)

Distinct count6
Unique (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.780373832
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Memory size1.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.103738646
Coefficient of variation (CV)0.7566387736
Kurtosis-0.2795182977
Mean2.780373832
Median Absolute Deviation (MAD)1.719014761
Skewness1.114915201
Sum595
Variance4.425716292
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1. 1.5 2.5 6.5 7. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 76 35.5%
 
1 70 32.7%
 
7 29 13.6%
 
3 17 7.9%
 
5 13 6.1%
 
6 9 4.2%
 
ValueCountFrequency (%) 
1 70 32.7%
 
2 76 35.5%
 
3 17 7.9%
 
5 13 6.1%
 
6 9 4.2%
 
ValueCountFrequency (%) 
7 29 13.6%
 
6 9 4.2%
 
5 13 6.1%
 
3 17 7.9%
 
2 76 35.5%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

RINaMgAlSiKCaBaFeType
01.5210113.644.491.1071.780.068.750.00.001
11.5176113.893.601.3672.730.487.830.00.001
21.5161813.533.551.5472.990.397.780.00.001
31.5176613.213.691.2972.610.578.220.00.001
41.5174213.273.621.2473.080.558.070.00.001
51.5159612.793.611.6272.970.648.070.00.261
61.5174313.303.601.1473.090.588.170.00.001
71.5175613.153.611.0573.240.578.240.00.001
81.5191814.043.581.3772.080.568.300.00.001
91.5175513.003.601.3672.990.578.400.00.111

Last rows

RINaMgAlSiKCaBaFeType
2041.5161714.950.02.2773.300.008.710.670.07
2051.5173214.950.01.8072.990.008.611.550.07
2061.5164514.940.01.8773.110.008.671.380.07
2071.5183114.390.01.8272.861.416.472.880.07
2081.5164014.370.02.7472.850.009.450.540.07
2091.5162314.140.02.8872.610.089.181.060.07
2101.5168514.920.01.9973.060.008.401.590.07
2111.5206514.360.02.0273.420.008.441.640.07
2121.5165114.380.01.9473.610.008.481.570.07
2131.5171114.230.02.0873.360.008.621.670.07